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Abstract. Based on Bergman's Lemma on centralizers, we ob- 
tain a sharp lower degree bound for nonconstant elements in a 
subalgebra generated by two elements of a free associative algebra 
over an arbitrary field. 



1. Introduction and the main result 

Let A n = K(x±, ■ • • , x n ) be the free associative algebra of rank n over 
a field K, B a subalgebra of A n generated by two elements in A n \K. 

Based on Bergman's Lemma on radicals [5 J that if the leading monomial 
of an element in a Malcev- Neumann (power series) algebra ( [H El El [7] ) 
over a field of characteristic has rT th roots, then so does the element 
itself, Makar-Limanov and Yu [8] gave a sharp lower degree bound for 
nonconstant elements in B when the characteristic of K is zero. 

However, in the case of positive characteristic, the Lemma on radical is 
not true, which can be shown by the following simple example that x 2 + 
x has no square roots in the Malcev-Neumann (power series) algebra 
F((x\, • ■ ■ , x n )) in free case over a field F of characteristic 2. Therefore, 
the method in [5] is no longer applicable. 

In this paper, based on Bergman's Lemma on centralizers we gen- 
eralize the degree estimate in j8] for any characteristic. 

Theorem 1.1. Let A n = K(x±,-- - ,Xk) be a free associative algebra 
over a field K and let f,gEA n be algebraically independent elements 
over F . Suppose the leading monomials v(f) andv(g) are algebraically 
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dependent over K , and, neither deg(f) divides deg(g) nor deg(g) di- 
vides deg(/) . Then for any P(x, y) G K(x, y)\K, 

deg([f,g}) 



deg(P(f,g)) > w deg{f)4eg{g) (P(x,y)) 



deg(/) + deg(g) ' 



2. Proof of the main result 

First we introduce some terminologies. Let K be a field of charac- 
teristic r(zero or prime), A n the free associative algebra generated 
by X = {xi, ■ ■ ■ ,x n } over K where n > 2, and F = (X) be the 
free group generated by X. By a group order, we mean that it is a 
total order of the group as a set, and coincides to the operation of 
the group as well; namely, if a group G has a group order, then G 
is totally ordered as a set, and to any a, b, c G G, if a > b, we al- 
ways have ca > cb and ac > be. Since it is possible to equip F a 
group order which is an extension of the partial order of the total 
degree [3J, namely if deg(a(xi, • ■ ■ ,x n )) > deg(6(xi, ■ • • ,x n )) where 
a(x u ■■■ ,x n ), b(xi, ■ ■ ■ , x n ) G F, then a(x x , ■ ■ ■ , x n ) > b(x u ■■ ■ ,x n ), 
K((F)) forms a Malcev- Neumann algebra [H El E] under this order. 
Any element / G A n can be viewed as an element of K((F)). Let 
the leading term (namely the least element in the support) of / be 
c • h with c G K* and h G F, we denote h by v(f) and c by c(f). 
For the degree functions, let deg be the total degree, or homogeneous 
degree, of a polynomial in K((F)) and deg^.. be the partial degree 
relative to Xj. Here we will restate the definition of weighted de- 
gree of a polynomial which has been defined in [5J E] just for con- 
venience. The weighted degree Wk u ... ! k n ( m ( x i, ■ ■ • , x n)) of a monomial 
m is equal to Y^=i k% • deg x . (m), and for a polynomial p(xi, ■ ■ • , x n ), 
w ki,- ,kn(p) = Tnax{wk u - ,k n {. m )\ m ^ supp(p)}. Obviously we have 
deg(m) = and deg x .(m) = ^.•••,o,i»o,-,o where 1 is the i~ th 

coordinite. 

Let f,g G A n be algebraically independent where v(f) and v(g) are 
algbraically dependent but deg(/) \ deg(p), deg(^) \ deg(/), and we 
assume that deg(g) = n > m = deg(/). 

Crucial to the proof of Theorem 1.1 is the following Bergman's Lemma 
on Centralizers [5j [6] . 

Lemma 2.1 (on centralizers). Let R be a commutative ring, S an 
ordered semigroup (the group order), and an element of R((S)) with 
invertible leading term a u u. (Thus, u is invertible in S, and a u in 
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R). Then there exists an element f with leading term 1, such that 
the element c = f~ 1 af (which clearly also has leading term a u u) has 
support entirely in the centralizer of u in S. 

Now we re-present the proof of Lemma on centralizers in [5J [6] for self- 
contain-ness of this paper as the journal that [HI [6] appeared is not well 
circulated. 

Proof. Clearly, we may assume without loss of generality that a u — 1. 
Let oo be a symbol outside of S with the property Ws G S, s < oo, and 
let S' = SU {oo}. Of course 5" is a totally ordered set. By 'the leading 
term of r G R((S)) is at\ we mean that if r = 0, then t = oo and a 
is undefined. To each pair x, y G S', the intervals of different types are 
defined as follows: [x,y] = {s G S'\x < s < y}; (x,y) = {s6 S'\x < 
s < y}', [x, y) = {x G S'\x < s < y}; (x, y) = {s G S'\x < s < y}. 

For s,t G S, s being invertible, we define - = max{ts _1 , s~H}. We 
also define ^ — 00 • Easy to get that x > y implies f > f • 

Let X be the set of all 3-tuples (t,b,e) where t G (u, oo], b G R((S)) 
with v{b) = u, c{b) = 1 and suppip) C [u, t)nC u (S), and e is an element 
with leading term 1 and support in [1, -) such that v(ebe~ 1 — a) = 
t, c{ebe" 1 — a) = a(here we mean that if ebe~ x — a = 0, then t = oo, 
and if not, a G R — {0}). 

Now establish a partial order on X: (t, b, e) < (f, b', e') if and only if 
t < t', supp(6' — b) C [t, t') and supp(e' — e) C [-, ^) (here notice that 
surely ^ < |- as being proved). The last two conditions say that b', e' 
"extend" b and e. 

X is nonempty since {y (a — 1) G X. Hence, to each ascending 
chain {(ti,bi,e{)\l G N + }, we just 'piece together' b t and e; as 6 and e, 
and let t = v(ebe~ l — a) (obviously here t > ti for each I), and then 
(t, b, e) becomes the upper bound of the chain. Hence, according to 
Zorn's Lemma, X has a maximal one. 

We now prove that if t < oo, (t, b, e) can not be a maximal element. If 
not, let (t, b, e) with t < oo be a maximal element, and we have three 
cases. 

Case 1. tu~ x > u~ l t. Then - = tu~ x . Let e' = e — atu~ x , and hence 

ii 

e /_1 = e _1 +atu~ 1 +o(tu~ 1 ) where o(tu~ l ) means that it is an element of 
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R((S)) each of whose support is greater that tu^ 1 . Let b' = b, and t' = 
v(e'b'e'^ 1 —a). Since (e — at-u _1 )6(e _1 + atu~ l + o(tu~ 1 )) — a = (ebe^ 1 — 
a)+aebtu~ 1 +ebo(tu~ 1 ) — atu~ 1 be~ 1 —a 2 tu~ 1 btu~ 1 — atu~ 1 bo(tu~ 1 ), and 
v (aebtu~ r ) = utu^ 1 > t, v(ebo(tu~ 1 )) > utu^ 1 > t, v(—a 2 tu~ l btu~ l ) = 
tu^utu^ 1 = t 2 u~ l > ^(notice that t > u), v(—atu~ l bo(tu~ r )) > 
tu^utu^ 1 > t, v ((ebe^ 1 — a) — atu~ l be~ l ) = v((at + o(t)) — atu" l u + 
o(t)) > t, as well as v ((ebe^ 1 — a) + aebtu^ 1 + ebo(tu~ r ) — atu^be^ 1 — 
aHu^btu^ 1 — atu^boitu^ 1 )) > max{v((ebe~ l — a) — atu~ 1 be~ 1 ), 
v(aebtu~ 1 ) ,v(ebo(tu~ 1 )) ,v(—a 2 tu~ l btu~ r ) ^(—atu^boitu^ 1 ))} , t' > t. 
It means that (t', b', e') > (t, b, e) which contradicts to (t, b, e) being 
maximal. 

Case 2. tu^ 1 < u~ l t. Similar to case 1, we just let e' = e — au~H, b' = 
b, and v (e'b'e'~ l — a) > t. 

Case 3. tu^ 1 = u~ l t. Then t commutes with u, so we can let e' = 
e,b' = b — at, and hence e'b'e'^ 1 — a = e(b — at)e~ l — a = (ebe^ 1 — 
a) — aete' 1 . Since ebe~ x —a = at + o(t), v(aete~ 1 ) = t, t>((e6e _1 — a) — 
aete~ l ) > t, namely t' > t which contradicts to (t, b, e) being maximal. 
Therefore, there must exist some (t, b, e) such that t — oo, namely 
ebe^ 1 = a, or e~ l ae = b. □ 

Let us give an example in K((F)) to understand Bergman's Lemma on 
centralizers and its proof. Here we will use the opposite definition of 
"well-ordered" on F, namely each subset has a greatest element. 

Example 2.2. In F we assume x > y and xy ■ (x 2 )~ l < (rr 2 ) -1 • xy 
(of course xy ■ (a; 2 ) -1 > (re 2 ) -1 • xy is also feasible since they are both 
extended total orders of the partial order of degree) and let a = x 2 +xy. 
By Bergman's method, we establish the approximation starting from 
(xy, x 2 , l)(b = v(a),t = v(a — v(a)), e = 1). Then e' — e + xy ■ (x 2 ) -1 = 
1 + xy ■ (x 2 ) -1 and (e') _1 = e _1 — xy ■ (x 2 ) -1 + 0(xy ■ (re 2 ) -1 ) = 1 — xy ■ 
(re 2 )" 1 + 0(xy ■ (x 2 )" 1 ) where 0(xy ■ (x 2 ) -1 ) means all the monomials 
behind are all less than xy ■ (x 2 ) -1 . b' — b — x 2 , and since e'b'(e')^ 1 = 
(1 + xy- (x 2 y l )x 2 (l—xy ■ (x 2 )~ l + 0(xy ■ (x 2 )^ 1 )), it is easy to get that 
v (e'b'(e')~ l — a) = x 2 -xy ■ (x 2 ) -1 since x > y, namely t' = x 2 ■ xy • (x 2 )^ 1 . 

After k steps, we get the three-tuple (tk, bk,ek). Now we claim that to 
all the t^s, if tj ^ oo, then deg(ti) = 2, and all the e^s are homogenous 
of degree and 6j = x 2 all the way. For k — 1, we see t\ = x 2 ■xy-(x 2 )^ 1 , 
e± — 1 + xy ■ (a; 2 ) -1 , b\ = x 2 and it satisfies. Assume that it is correct 
for k — n — 1. If t n -i = oo, then e n -\b n -ie~\ = a, and we prove 
it. If not, since t n _i is a monomial of degree 2 however it is less than 
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x 2 , so it can not commute with x 2 (By Bergman [I], the centralizer of 
any element of K(x±, . . . , x n )\K is a polynomial algebra in one variable 
over K). Hence b n = 6 n _i = x 2 , e n = e n _i + at n -x~ 2 /ax~ 2 -t n , and the 
new term of e n will always has degree 0. Then e n is also homogenous 
of degree and so is e~ x . Obviously e^ne' 1 is homogenous of degree 
2 and since a is homogenous of degree 2, e^e" 1 — a is homogeneous 
of degree 2 or equal to 0, namely deg(t n ) = 2 or t n = oo. 

It means that after finite steps of the algorithm, we always get eae' 1 = 
x 2 + t where deg(t) = 2, or we get eae -1 = x 2 . Now we consider 
the subset S of three-tuples (t, b, e) defined in the proof of Lemma on 
centralizers where e being homogeneous of degree and b = x 2 . Since 
a is homogenous of degree 2, t is also of degree 2 or oo. Then, By 
preserving the order introduced by Bergman on S, if t is not oo, we 
can always construct an 'extension' of b and e such that (t', b', e') G S is 
greater. However, by the 'piece together', we will always get a maximal 
element, and hence we get the maximal element with t — oo, namely 
there exists an e which is homogenous of degree such that eae -1 = x 2 . 

Remark 2.3. The steps in the proof of Bergman is to construct a 
'better approximative' element to the maximal element instead of cal- 
culating the maximal three-tuple. 

According to the discussion in the example above, we obtain 

Proposition 2.4. If an element a e K((F)) is homogenous, then there 
exists some e G K((F)) with leading term 1 which is homogenous of 
degree such that eae' 1 = c(a)v(a). 

□ 

Then according to Lemma on centralizers, there exists some t G K((F)) 
with c(t)v(t) = 1 such that the support of tft' 1 is in CV(t> (/)). Let 
v(f) = h q where h is the generator of Cp(h), and then tft' 1 = 
Yli=-oo a i^ 1 w hh <2j G K. Let /' = tft' 1 , g' = tgt~ x , and we have 
the following 

Lemma 2.5. For any P(x 1 y) G K(x,y), P(f',g') = tP(f,g)t~ 1 . 

Proof. Let P(x,y) = Eto Ej=o a 'i 2; '?/ : ' f° r some nonnegative integers 
i and j where a^ G K. Then 

m n m n 

i=0 j=0 i=0 j=0 
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Since v(t) = 1, deg(P(/, g)) = deg(P(/', g')) where the degree function 
is the homogenous degree of K((F)). So we can just do degree estimate 
iovP(f',g'). 

Two elements of A n are called algebraically independent over K if they 
generate a subalgebra of rank two. If v(f) and v(g) are algebraically 
independent, then for all P(x,y) G K(x,y)\K, 

deg(P(/, g)) = Wdcg(f),deg(g){P{x,y)), so we may assume without loss of 
generality that v(f) and v(g) are algebraically dependent. However, if 
deg(/) | deg{g) or deg(g) | deg(/), then deg(/) + deg{g) can be reduced 
by some automorphism, so we also assume deg(/) \ deg(g) as well as 
deg(g) j deg(/). We assume that / and g are algebraically independent 
over K but v(f) and v(g) are not. Hence since v(f') = v(f) and 
v (g') — v (q)) f an d 9' are algebraically independent but v(f') and v(g') 
are algebraically dependent. Then since h generates its own centralizer 
in A n , v(g') = h p for some positive integer p. Let g' = h p + g[ where 
v(g[) < h p , and if v(g' 1 ) and h are dependent, then v(g' 1 ) = h Pl for some 
integer p 1 which is less than p. This can be done inductively. 

Lemma 2.6 (on steps). The above process will stop after a finite 
number of steps. 

Proof After k steps, let g' = Yli=i a ih mi +g'k- Obviously deg([/', g'\) = 
deg([/',^]) < deg(f') + deg(g' k ), so deg(^) > deg([/',^]) - deg(/') = 
deg([/, g]) — deg(/). Here notice that deg(h) > 0, so after each step, if 
possible, the deg(g-) decreases by at least 1 which means after at most 

deg(s) - (deg([f,g\) - deg(f)) = deg(fg) - deg([f,g\) 

steps, the process will stop. □ 

Hence, after a finite number of steps we get g' = J2i =p _ k Oih % + s where 
v(s) and h are algebraically independent. 

Let C be the subalgebra generated by h, h~ x and s, and equip it with 
the weighted degree function w^ v where Wi ;P (h) = 1 and w 1;P (s) = p. 

Of course /', g' G C, and we write /', g' as the leading parts of /' and g' 
respectively relative to w\ yP . To any polynomial P(x,y), let P denote 
the leading part relative to the weighted degree function w qtP . Let deg 
be the homogenous degree of A n , and we have: 

Lemma 2.7 (on degrees). P(f',g') ^ and 

deg(P(f',g'))>deg(P(f',g')). 
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Proof. Consider P(f ,g') = Q{h,hr l ,s) as well as P(f',g') = R(h,h~\s) 
as the element of C, and then R is the leading part of Q relative 
to u>i, p , so all the monomials of R appear in Q with nonzero coeffi- 
cients. Since h and v(s) are algebraically independent, deg(P(f, g')) = 

Wde g (h),deg(s)(Q( h , h ~ 1 ,s)) anddeg(P(/',5f / )) = w deg{h)4eg{s) {R{h, h' 1 , s)). 
We conclude by the definition of weighted degree. □ 

Now we only need to estimate deg(P(f',g')). 

The following procedure is similar to the counterparts in [8]. 

Now we can write / = t m and g = t n + s just for convenience since 
deg(/) = iti and deg(g) = n. Then deg(t) = 1 and to each polynomial 
m(x,y), deg(m(t, s)) = deg lieg r s \(m(x,y)), or we can say that v(t) and 
v(s) are algebraically independent over K. 

Let N = w mjn (P(x,y)), and q be the greatest integer among the inte- 
gers which are not greater than — ^— (or we can denote it by q = [— r— 1). 
Define Q(t, s) = P(t m , t n + s), and we have 

Lemma 2.8 (on monomials). There is a monomial u(t, s) in supp(Q) 
such that deg s (u) < q. 

Proof. Choose a monomial z(x,y) in supp(P(x, y)) such that 

(1) deg y (z) is the greatest; 

(2) Among all the monomials whose degree related to y is equal to 
deg y (z), z is the greatest under the lexicographic order x » y. 

Let z(x,y) = x" 1 / 1 • • ■x OLk y l3k with a\, (3k > and «j > 1, 2 < i < k, 
Pj > I, 1 < j < k — 1. Let I = deg x (z) and J = deg y (z). If J < q, 
then the degrees related to s of all the monomials in supp(Q) are not 
greater than q, and since in Proposition 2.4 it is proved that supp(Q) is 
not empty, we prove the lemma. Hence assume J > q, or J > q + 1. If 
I + J > 2q + 2, then since N = ml + nJ, N = m(I + J) + (n-m)J > 
m(2q + 2) + (n — m)(q + 1) = (m + n)(q + 1) which contradicts to 
< q + 1, and hence / + J < 2q + 1. 

m+n ^ ' — ^ 

Now for z(x, y) = x a ^y^ ■ ■ ■ x ah y^ k , replace x by t m , and, if Pi = 2cr.j, 
replace y l3i by (st n ) ai ; if = 1o~i + 1, replace y l3i by (st") CT *s. Then we 
get a monomial w(t, s). It is easy to verify that u(t, s) is a monomial 
in the extension of z(t m , t n + s) = t mai (t n + sf 1 ■ ■ ■ t m ° k (t n + s) h , and 
the coefficient of u is just the coefficient of z in supp(P) and hence 
nonzero. 
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Now we are going to prove that u(t, s) cannot come from other exten- 
sions of the monomials in supp(P) after replacement. 

We divide z(x,y) into different parts first: x ai ; yP'x ai+1 where 1 < % < 
k — 1; y@ k . Let l(x, y) be a part of z(x, y), and we define i/)(l(x, y)) being 
the corresponding part in u(s,t) after replacement. So ip(x ai ) = t mai 
and so on. If u(s,t) is also in the extension of zi(t m ,t n + s) where 
Zi(x,y) G supp(P(x,y)), then let l±(x,y) be a part of Zi(x, y), and 
we define ipi(l(x,y)) to be the corresponding part in u(s,t). Hence 
Zi(x,y) can also be divided in to rL+i fe+1 hi(x,y) with ipi(hi) = tp(x ai ), 
fpi(hi) = tp(y l3i x ai+1 ) where 1 < % < k — 1, and i^i(h k+1 ) = t/j(y^ k ). 
Obviously deg y {h\) > deg y (x ai ). To each i, 1 < i < k — 1, if fa is odd, 
then ipi{hi + i) = (st n ) ai s-t mai+1 , and since n < m, the t n between two s 
has to come fromg, so h i+1 = y^-h' i+1 ^ where ip\{h' i+1 ) = t mai+1 , namely 
deg y (h i+1 ) > fa. If fa is even, then ip hi+1 = {st n ) a ■ t ma ^ = (st n ) a - 1 s • 
t ma i+1 +n_ Rence h . +i = y ^h' i+1 where MK+i) = t ma *+ 1+n . However, 
since n < m, h' i+1 cannot be of the form x p for some integer p, and hence 
deg y (h' i+1 ) > 1, namely deg y (h i+1 ) > fa. To h k+1 , since ipi{h k+1 ) = 
ip y D k = st n st n ---s or st n st n ■ ■ ■ st n , it has to equal to y" k . Hence, 
deg 2/ (zi(x,y)) = Ylltl deg y (hi) > X)?=iA = deg y (z(x,y)). However 
deg y (z(x, y)) is the greatest one among the monomials in supp((P)), 
deg y (zi(x,y)) = deg y (z(x , y)) , and the only case is that hi(x,y) = x ai , 
and for 1 < % < k — 1, h' i+1 = x at+1 if fa is odd and deg(h' i+1 ) = 1 
if fa is even. Let hj +1 be the monomial with least j such that fa is 
even but hj +1 ^ yx aj+1 , then since deg y (h'j +1 ) = 1, hj +1 = x r yx aj+1 ~ r 
for 1 < r < etj+i. But if so, Zi(x,y) > z(x,y) under the lexicographic 
order x » y which contradicts to z(x, y) being maximal, hence no such 
hj +1 exists, namely each hj +1 of this kind is equal to yx aj+1 . Hence 
Zi(x,y) = z(x,y) and the coefficient of u(s,t) is not zero. 

According to the definition of u(s,t), we see that 



Obviously that I > k - 1, and hence J -f- < ^±f±i < = q + i(be 
reminded that I + J < q+1)- Notice that deg s (u) — q + 1 only if all the 

fas' are odd and I — k — 1, and z(x,y) = y 2ai+l xy 2<T2+l ■ ■ -xy 2eTk+1 or 
y 2ai+1 xy 2a2+l ■ ■ ■ xy 2(Jk - 1+1 x. Then in z(t m , t n + s) we replace y 2ai+1 x by 
(jn s y H n . t m and c h oose s ) = (t n s) a H n t m (st n ) a2 s We denote 




i=i 
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z(x,y) = y 2ai+1 x ■ h(x,y) and if u(s,t) can also come from another 
monomial Zi(x,y), then Zi(x,y) = y 2a2 hi(x,y)h(x,y) with ipi(hi) = 
t m+n . Hence hi(x,y) = xy or yx. Notice again that z(x,y) is the 
maximal element under the lexicographic order x » y, and hence 
hi(x,y) = yx which means z%(x,y) = z(x,y). Then the coefficient of 
u(s, t) is nonzero and deg s (u) = q + l — l = q. □ 



Proof of Theorem 1.1. Recall that deg(/) = m, deg(g) = n, 
deg(t)_= 1, deg(s) = deg([f,g}) - deg(/) = deg([f,g}) — m, N = 
w m,n{P{ x iV))- We have proved that there exists some 
u(s,t) e supp(P(t m , t n + s)) such that deg s (u) < N/(m + n). Since 
= deg t (u) + n ■ deg s (u), then deg(u) = deg t (u(t, s)) + deg s (u(t, s)) ■ 
(deg([/,5-])-m) = N+deg s (u(s,t))(deg([f,g])-m-n). Since deg([/, g\ )- 
m — n < 0, we get 

deg(P(/, s) ) > deg(P(/,5)) > deg(«) > N + ^(V,lA) - ™ - ») 



m + n 

deg([f,g\) 



m + n 

Since m + n = deg(fg), we get 



-w mjTl (P). 



deg(P(f,g)) > deg (^) W de g (f),de S ( g )(P)- 



□ 



Example 2.9. Let / = x n , g = x m + y, P = [x, y) . Then 

deg(P(/^)) = k(n + l) = ^^^e g (/) > de g (,)(P), 

which shows the estimate is sharp. 

Remark 2.10. The methodology in this paper, unlike that in [8], is 
not applicable for commutative case, as in that case there is no invari- 
ant to judge whether two polynomials are algebraically dependent or 
independent over a field of positive characteristic, and in fact to find 
such an invariant is an interesting question, and it is also interesting 
to get a sharp degree estimate for the commutative case for positive 
characteristic. 
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